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We show that the steady state entropy production rate of a stochastic process is inversely pro¬ 
portional to the minimal time needed to decide on the direction of the arrow of time. Here we 
apply Wald’s sequential probability ratio test to optimally decide on the direction of time’s ar¬ 
row in stationary Markov processes. Furthermore the steady state entropy production rate can be 
estimated using mean first-passage times of suitable physical variables. We derive a first-passage 
time fluctuation theorem which implies that the decision time distributions for correct and wrong 
decisions are equal. Our results are illustrated by numerical simulations of two simple examples of 
nonequilibrium processes. 
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Processes that take place far from thermodynamic 
equilibrium are in general irreversible and are associated 
with entropy production. Irreversibility implies that a 
sequence of events that takes place during a process oc¬ 
curs with different probability than the same sequence 
in time-reversed order. Irreversibility and the thermo¬ 
dynamic arrow of time can be illustrated considering a 
movie displaying the evolution of a complex dynamic pro¬ 
cess. Such a movie can be run either forward in time or in 
reverse. For an irreversible process it is possible to decide 
whether the movie is run forward or in reverse defining 
the direction of the arrow of time by the direction in 
which entropy increases on average [T]. For a system 
at thermodynamic equilibrium, however, even though all 
atoms or molecules move rapidly in all directions, it is 
impossible when watching a movie to tell whether it runs 
forward or in reverse. This raises the following ques¬ 
tion: Can the time rjec needed to decide between two 
hypotheses (movie run forward or in reverse) be related 
quantitatively to the degree of irreversibility and the rate 
of entropy production? 

Decision theory provides a general theoretical frame¬ 
work to optimally make decisions based on observations 
of stochastic processes |5]. An important question of de¬ 
cision theory is what is the earliest time to make a de¬ 
cision d between two competing hypothesis Hi and Hq 
with a given reliability, while observing a stochastic pro¬ 
cess. In 1943, A. Wald made a pioneering contribution 
to this problem by introducing the sequential probabil¬ 
ity ratio test (SPRT) [3], which provides the minimal 
mean decision time for a broad class of stochastic pro¬ 
cesses |4]. Wald’s SPRT states that the decision d = 1 
{d = 0) should be made when the cumulated logarithm 
of the likelihood ratio C{t) for the first time exceeds 



FIG. 1. Log-likelihood ratio of the sequential probability ra¬ 
tio test in the arrow of time as a fnnction of time in a drift- 
diffusion process with diffusion coefficient D — 0.52/im^/s 
and drift velocity v = 65/rm/s. The simulation time step is 
At = 0.1ms. The thresholds of the test are shown as hor¬ 
izontal lines for symmetric error probabilities equal to 20% 
(red) and 1% (blue). The thresholds Lo and Li correspond 
to the choice of one of the two hypotheses: the sequence runs 
forwards (Li) or backwards (Lo) in time. With 20% error 
probability, the decision is made faster (red circle and verti¬ 
cal red dashed line) than for 1% error probability (blue circle 
and vertical blue dashed line). 


(falls below) a prescribed threshold Li (Lq) (see Fig. 
[^. The thresholds Li and Lq are determined by the 
maximally allowed probabilities to make a wrong deci¬ 
sion Q!i = P{d = l|iLo) and oq = P{d = 0\Hi). Here, ai 
(ao) is the probability to incorrectly make the decision 
d = 1 (d = 0) when the hypothesis Hq (Hi) is true. 
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In this Letter, we derive a general relation between 
the average entropy production rate in a nonequilibrium 
steady state and the mean time to decide whether a sta¬ 
tionary stochastic process runs forwards {Hi) or back¬ 
wards in time (iLo) using the SPRT. Furthermore, we in¬ 
troduce a fluctuation theorem for the first-passage time 
probability distribution of the total entropy changes and 
obtain a fluctuation theorem for the decision time distri¬ 
bution of the SPRT in the arrow of time. Our work re¬ 
veals that entropy production can be estimated measur¬ 
ing first-passage times of stationary stochastic processes. 

We consider a physical system in a nonequilibrium 
steady state. We denote by Xt = {W(s)}*^o ^ path de¬ 
scribing the evolution of a state as a function of time 
t. We denote by Xt the time-reversed path Xt = 
{X(t—s)}g^g [S]. The state of the system is characterized 
by the path probability P{Xt). The entropy production 
associated with the path Xt can be defined as [S] 

AStot[Xt]=k\n^^ , ( 1 ) 

where k is Boltzmann’s constant. We now perform a 
SPRT of two hypotheses. Given a path Xt, we want 
to decide whether it corresponds to a forward or time- 
reversed trajectory of the nonequilibrium steady state. 
We therefore consider the hypothesis = Hi that 
the path runs forward in time with the conditional 
probability P{Xt\H^) = P{Xt)', and the hypothesis 
= iLg that the dynamics is time reversed, for which 
P{Xt\Hj^) = P{Xt). Using the SPRT, the decision is 
made when the log-likelihood ratio or Turing’s weight of 
evidence |7] 


C{t) = In 


P{Xt\H^) 

P{Xt\H^) 


( 2 ) 


reaches for the first time one of the thresholds Li = L and 
Lg = —L, where we have chosen for simplicity a SPRT 
with symmetric decision error probabilities ag = ai = a. 
When C{t) is continuous, we have L = ln[(l — a)/a] [3]- 
The entropy production and the log-likelihood ratio 
are related by 


C{t) = . (3) 

This provides a connection between decision theory and 
stochastic thermodynamics. Moreover, it allows us to 
obtain relations between average decision times in the 
SPRT and the average rate of entropy production. Ap¬ 
plying the SPRT to continuous time Markov processes 
(See Supplemental Material and Ref. [HI), we show 
that the mean decision time for a stochastic process with 
continuous C{t) is given by 


('Flee) 


L(1 — 2a) + {Cex)dec 

(dC/dt) 


( 4 ) 


Here (...) denotes an ensemble average in steady state. 
An average over the ensemble which starts from an initial 
distribution of states that equals the distribution of states 
at the decision times is denoted by (... )dec- The excess 
log-likelihood ratio £ex is defined as 


C 


ex 



dt' 


( 5 ) 


The mean decision time of the SPRT for independent 
identically distributed (i.i.d.) observations is a special 
case of Eq. (Q for which (£ex)dec = 0. This is because 
for an i.i.d. process the state distribution is identical to 
the stationary distribution. 

We now apply the theory of decision times to the SPRT 
on the arrow of time of a nonequilibrium process. The 
relation (i together with our Eq. (@ describing the av¬ 
erage decision time can be used to express the average 
entropy production rate in steady state as 


1 / dgtot \ _ L{1 - 2a) -I- (A5'ex)dec/fc 
k\ dt / (rdec) 


where 


AW 


nOO 

rd5tot 

/d5tot\l 

L 

dt' 

\ dP /J 


dt' 


( 6 ) 

( 7 ) 


denotes the excess total entropy change. 

In the limit of small a, the mean decision time becomes 
large, (AS'ex)dec/('rdec) becomes small and, thus, Eq. ^ 
simplifies to 


1 /d^\ _ L{l-2a) 
k\ dt / ~ (rdec) 


Equations (6) and (8) show that the minimal average 
time needed to decide whether a process runs forward 
or backward in time is inversely proportional to the av¬ 
erage entropy production rate. Approaching thermody¬ 
namic equilibrium, the mean decision time diverges be¬ 
cause L(t) — 0 in this limit. If the reliability of the 
decision is increased, decision times increase correspond¬ 
ingly. Because the average entropy production rate is a 
property of the process only and not of the SPRT, the ra¬ 
tio given in the right hand side of is thus independent 
of the error probability a. 

Making decisions in the arrow of time provides a novel 
way to estimate the entropy production rate of nonequi¬ 
librium Markovian processes. Estimators for the steady 
state entropy production rate can be obtained from the 
first-passage times r of a suitable physical observable 
r(At). If we use the first-passage of a physical observable 
through a threshold value to decide on the arrow of time 
and if this decision has an error probability a, then it fol¬ 
lows from the optimality of the SPRT that (r) > (fjbc), 
i.e., the mean first-passage time (r) is larger or equal to 
the mean decision time of the SPRT given in Eq. (8). 
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The resulting estimator of the entropy production pro¬ 
vides a lower bound to the exact value: 

where D[-^ || ^] = D[P{d\H^)\\P{d\H^)] = ln[(l — 
a)/a]{l — 2a) is the Kullback-Leibler divergence be¬ 
tween the conditional probabilities of the decision vari¬ 
able. When decisions in the direction of time are made 
based on the log-likelihood ratio, i.e. when T = AStot/k, 
Eq. (9) becomes an equality. 

The stochastic nature of decision making in the arrow 
of time can be characterized by the probability density 
^’(Tdec) of making a decision at time Tdec- The connec¬ 
tion between decision theory and thermodynamics im¬ 
plies a relation between the decision time distribution 
and the distribution of entropy production AS'tot- For 
Markovian processes, the probability density P{AStot',t) 
of entropy production AS'tot during the time interval 
t is related by a fluctuation theorem to the proba¬ 
bility density to reduce entropy by the same amount: 
P{AStout)/P{-AStot',t) = exp(AStot/fc) [IMIII- In 
addition, we find that the probability distribution of the 
first-passage time r of entropy production also obeys the 
following detailed fluctuation theorem if the transition 
probabilities are translationally invariant |15| (see Sup¬ 
plemental Material 0): 

P(r;AStot) . (10) 

P{t; -AStot) 



FIG. 2. Estimation of the steady state entropy production 
rate of a drift-diffusion process with periodic boundary con¬ 
ditions (e.g., particle in a ring, see inset) by a sequential 
probability ratio test in the arrow of time. The estimator 
L{1 — 2a)/{Tdec) is shown as a function of the error prob¬ 
ability a for different simulation time steps At/r^ and nor¬ 
malized by Tc = l/[(dS'tot/dt)/A:] = D/v^. For the vertical 
axis we use the empirical mean of Tdec and L is the threshold 
for the decision. The data is obtained from 1000 numerical 
simulations with drift velocity u = 65 /rm/s and diffusion co¬ 
efficient D = 0.52/rm^/s. The horizontal line corresponds to 
the steady state entropy production rate. 

P(rdecjd) = P{Tdec\d)P{d), we then find that the con¬ 
ditional probability densities for the decision time obey 

T’(T'dec| ~t) = P(T<iec| ■i~) ■ (13) 


Here, P{t; A5'tot)dT denotes the probability to reach the 
value AS'tot for the first time in the time interval [r, r -|- 
dr] given that the entropy production has not reached 
—AStot before. 

The relation between entropy production and the log- 
likelihood ratio it together with the first-passage time 
fluctuation theorem implies for the SPRT in the ar¬ 
row of time 


Pi'^dec: P) 
Pij'dec 5 P) 


exp (L) 


( 11 ) 


Here, P(Tdec; L) is the probability distribution of the 
decision time of the SPRT for a given error rate a. 
P(rdec; L) is also the distribution of first-passage times to 
reach the threshold L for the first time without reaching 
the threshold —L before, given is true. The probabil¬ 


ity distributions in (111 are equal to the joint probability 
densities to make a decision d G } at time Tdeo 

P(Tlec; 0 P(Tdec?T), and P('7dec,’^ ) P(Tiec? P) ■ 


Equation (111 thus implies 


P(pieC7 ^) 
P(Tiec, ^ ) 


= exp (L) 


( 12 ) 


From Eq. (12| it follows that P{d =^)/P{d =•«—) = 


exp(P), consistent with previous results obtained for two¬ 
boundary first-passage time processes Hi IT]. Using 


This implies that even though decisions are made with 
different probabilities, the conditional decision time dis¬ 
tributions have the same shape for both outcomes. We 
therefore call Eq. (131 the Fluctuation Theorem in the 
Arrow of Time (FTAT). Equations ([^ and (131 are the 
main results of this paper. 

To illustrate how Eq. ^ provides an estimator for 
the steady state entropy production rate, we discuss two 
paradigmatic examples of nonequilibrium stochastic pro¬ 
cesses. We first consider a drift-diffusion process with 
periodic boundary conditions of a particle with position 
x{t), average drift velocity v, and diffusion coefficient D. 
If Einstein’s relation holds, D = kT/j where 7 is a fric¬ 
tion coefficient, the steady state entropy production rate 
is (dS'tot/df)/fc = v'^/D = F^/{jkT), where F = jv is 
the friction force and T is the temperature of the thermal 
bath |TH|. Figure|^shows L(l —2a)/(Tdec) obtained from 
1000 numerical simulations of the SPRT in the arrow of 
time (markers) as a function of the error probability a for 
different values of the simulation time step At together 
with (d5'tot/df)/fc (blue solid line). For the drift-diffusion 
process, the log-likelihood ratio for the SPRT in the ar¬ 
row of time is simply given by C{t) = {v/D)[x{t) — a:(0)]. 
As long as the simulation time step obeys At Tc, where 
Tc = /c/(dS'tot/dt) = D/v'^, the SPRT in the arrow of 
time provides an accurate estimator of entropy produc- 
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FIG. 3. Conditional distributions of the decision time 
4’('rdec| —>■) and P(rdec| <—) obtained from 10® numerical 
simulations of a drift diffusion process with diffusion D = 
0.52 ^m^/s and drift v = 65/rm/s. The simulation time step 
is At = 0.1ms and the error probability a = 0.01. 

tion independent of the error probability a. For larger 
At, the estimator is only accurate for small a but pro¬ 
vides a lower bound to the steady state entropy produc¬ 
tion rate for larger a. In our simulations, we also cal¬ 
culated empirical conditional decision time probabilities 
^’('rdecl -t) (green) and P(Tdec| t-) (purple), which are 
shown in Fig. 3 for a = 0.01. Figure 3 demonstrates the 
validity of the FTAT given in (HI for the drift-diffusion 
process. 

The drift-diffusion process is a simple example and 
serves as an illustration of our results. We now test 
whether our results also hold in more complex nonequi¬ 
librium stochastic processes that involve discontinuous 
jumps of the state variables. We therefore discuss the 
SPRT in the arrow of time for the case of a flashing 
ratchet with periodic boundary conditions. We con¬ 
sider a Brownian particle with diffusion coefficient D, 
subject to a piecewise linear periodic potential that is 
switched on and off stochastically at a constant rate 
w m- The log-likelihood ratio of the SPRT in the arrow 
of time in steady state can be approximated by the cu¬ 
mulated work W exerted on the particle during switches 
C{t) = W{t)/kT. Here W{t) = where AVi is the 

potential energy change during the switching event i and 
the sum is done over all switches that occur before time 
t [^. Figure l^shows the estimate of L(1 — 2a)/(rdec) as 
a function of the reliability 1 —a. The plot shows that the 
SPRT in the arrow of time provides a lower bound for the 
steady state entropy production rate (blue open circles) 
and converges for high reliability to the correct value. In 
addition, Fig.|^shows the conditional distributions of the 
decision times revealing that the FTAT holds to good ap¬ 
proximation for high error probabilities despite that the 
propagator is not translationally invariant. 

When using the estimator given by Eq. (§ , which in¬ 
cludes the excess entropy production, the entropy pro- 


FIG. 4. Estimation of the entropy production rate using the 
SPRT in the arrow of time in a flashing ratchet model using 
the right hand side in Eq. Q (blue open circles) and the right 
hand side in Eq. § (blue open diamonds) as a function of the 
reliability of the test. Red open squares are given by the ratio 
between D[—>■ || <—] and the mean first-passage time (r) of the 
position of the particle in Eq. §. The results were obtained 
from 1000 numerical simulations with time step At = 1 fis, 
diffusion coefficient D — I fim^/ms, Vo = 10 kT, a = 1/3 fim, 
and uj — 10 kHz. The characteristic time = 0.07 ms is 
the numerical estimate of fe/(d5tot/dt) obtained from a single 
stationary trajectory of 10^ data points. Inset: Gorrection 
term in § given by [(AS'ex)dec/fc('rdec)] Tc as a function of 
1 — a (blue triangles). The solid line is a linear fit of the data. 




FIG. 5. Gonditional distributions of the decision time 
P(T"dec|—>) and P{Tdec\<—) obtained from 10® numerical sim¬ 
ulations of the flashing ratchet with the same simulation pa¬ 
rameters as in Fig. The two figures show the distributions 
for two different error probabilities a. 


duction rate is estimated more accurately at low reliabil¬ 
ities (Fig. 1^ blue diamonds). The inset in Fig. [^confirms 
that the correction term in Eq. (§ tends to zero for a 
small. Note that the estimator T(l —2Q!)/(Tdec) in Eq. ([^ 
provides a lower bound at small a because of the discon¬ 
tinuous jumps in the state variables. Using an heuristic 
estimator given by the ratio D[—t || where r is 

the first-passage time of the position of the particle, also 
bounds from below the steady state entropy production, 
as follows from Eq. (§ (Fig. HI red squares). 
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The dynamics of a stochastic nonequilibrium process 
provides evidence on the arrow of time to an observer. 
Reliable decisions on the direction of the arrow of time 
can be made measuring first-passage times of physical 
observables. When the physical observable used is the 
entropy production, the decision time is minimized. In 
addition, measuring first-passage times of physical ob¬ 
servables provides estimators for the steady state entropy 
production rate that are lower bounds to the true value. 
This follows from the optimality of the SPRT. Using this 
method to estimate entropy production, it is not nec¬ 
essary to sample the whole space of stochastic trajecto¬ 
ries as required in previous approaches pH - E!T| . Interest¬ 
ingly, our fluctuation theorem for the two-boundary first- 
passage time distribution of entropy production (10) im¬ 
plies that the shape of the distributions of decision times 
for correct and wrong decisions are equal even though 
the probabilities in both cases are different. The connec¬ 
tion between decision theory and thermodynamics pro¬ 
vided here could be of particular interest in the context 
of nonequilibrium processes that involve feedback con¬ 
trol, often found in biology and engineering. 
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SUPPLEMENTAL MATERIAL 

SI. MEAN DECISION TIME OF THE SPRT IN 

CONTINUOUS-TIME STATIONARY MARKOV 
PROCESSES 

The aim of this section is to show the derivation of 
Eq. (4) in the Main Text. We first review the SPRT for 
Markov chains (Sec. SIA) and the corresponding mean 
decision time formula obtained in Ref. [5] (Sec. SIB). Us¬ 
ing these results we then derive Eq. (4) in the Main Text 
relating the mean decision time to the rate of change of 
the log-likelihood ratio for the SPRT between continuous 
time Markov processes (Sec. SIC). Finally we provide a 
full derivation of Eq. ([30| to help the reader (Sec. SID). 


A. Sequential probability ratio test for stationary 
Markov chains 


We consider a discrete-time sequence A” = 
{xi,X 2 ,--- ,Xn) where the variables Xi are elements of 
a countable set of observations fl. The sequence is de¬ 
scribed by a stationary Markov process and therefore 
P(Af) = Pss(xi)P(x 2 lxi) ■ ■ ■ P(xnlxn-i), where PgsO) 
is the stationary distribution and P(xjlxi) is the transi¬ 
tion probability from Xi to Xj. Based on the sequence 
A”, we consider two competing hypotheses Jig ^-nd Hi 
for which P(X^lHo) = Pss(Xi)Po(X 2 lxi) ■ ■ ■ Po(XnlXn-l) 
and P(Af|Pi) = Pss(a;i)Pi(a: 2 |a;i) • • • Pi(a:„|a;„_i), re¬ 
spectively. Here we consider the case where the station¬ 
ary distribution is the same under both hypotheses. The 
sequential probability ratio test (SPRT) between Hg and 
Hi follows the evolution in time of the following log- 
likelihood ratio Cn 


P{X^\Hi 

P{X^\Hg) 


|4in Pi(x.+i|a:0 

^ Pg[X^+i\x^) 


(14) 

(15) 


When Lg < Cn < Pi the test continues and a new obser¬ 
vation Xn+i is taken. When £„ > Pi the test terminates 
and the hypothesis Hi is accepted and when £„ < Lg 
the test terminates and the hypothesis Hg is accepted. 
The threshold values are approximatively given by 


(^) ' 


with ag the allowed error probability to decide for Hi, 
when the hypothesis Hg is true, and ai the allowed error 
probability to decide for Hg, when the hypothesis Pi is 
true. For a continuous-time sequence of observations the 


threshold values are exact when the log-likelihood ratio 
is a continuous function in time |3] . 

One can equivalently write the transition matrices in 
Dirac’s bra-ket notation as follows: P(xjjxi) = (j|P|z), 
Pg{xj\xi) = (j|Po|*) and Pi{xj\xi) = (j|Pi|*). Similarly, 
for the stationary distribution Pss{xi) = (i|pss)- 


B. Mean number of observations in the SPRT for 
Markov chains 


Given a sequence of observations, ridec is defined as the 
number of observations taken until the test terminates. 
The mean number of observations E[ndec] to make a de¬ 
cision (for either Hg or Hi) over all possible sequences 
is, following Phatarfod’s derivation, given by |9] 

E[ndec] = --(.18) 

Note that in the corresponding equation (3.10) in Ref. [3] 
there is a typo and the sign of the last two terms in 
the numerator are swapped. Here E denotes an average 
over all possible infinitely long sequence^ The first term 
in the numerator is the expected value of the 

log-likelihood ratio when the decision is taken. The rest 
of the terms can be defined in terms of the following 
generating matrix. 





/ (^|Pib') Y 

l(*|Po|j)/ 


,(19) 


where z S K. The matrix P(z) has eigenvalues Xi{z) with 
corresponding right eigenvectors \ri{z)) and left eigen¬ 
vectors {li{z)\. Here Ai(0) = 1 is the Perron root of 
the transition matrix P and Ai(z) is the corresponding 
eigenvalue of P(z). Thus |ri(z)) is the right eigenvector 
of Ai(z) and (Zi(z)| is the corresponding left eigenvector. 
In the limit of z —>• 0, |ri(0)) = \pss) and (/i(0)| = (1| 
for which (l|f) = 1 for all f € fl. The denominator in 
Eq. ([^1 equals A((0) = lim^^o sAi(z) and (/((O)] = 
lim^^o remaining term E [(ri(0)|a:„j^J] 

contains the ensemble average of the right eigenvector 
l^ridec) corresponding to the value of the observation a;„dec 
when the decision is taken: {j\xnd^c) — , where S 

denotes the Kronecker delta. 


C. Mean decision time formula in the continuous 
time limit 


We first revisit matrix perturbation theory and then 
apply it to determine the different terms in Phatarfod’s 
mean decision time formula given by Eq. (181. 


^ The expectation E[.] corresponds to the ensemble average {■) in 
the Main Text. This is because we use the bra-ket notation for 
vectors and matrices. 
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1. Revision of matrix perturbation theory 


We will use some results from perturbation theory, see 
Chapter 2 of |23]. For two matrices A and B which satisfy 
the relations |(t|A|j)| < 1 and |(i|B|j)| < 1 we consider 
the perturbed matrix C(z) 

C(z) = A + zB , (20) 


with z € K the perturbation parameter. The simple 
eigenvalues Afe(z) of C(z), and their corresponding eigen¬ 
vectors |rfc(z)) and (^fe(z:)| are given by a convergent 
power series in the perturbation parameter z 

Afc(z) = A/c(0) -I- z A5.(0) -I- — A)'(O) H- (21) 

\rk{z)) = |r,( 0 )) + z |rU 0 )) + ^ |r^'( 0 )) + • • • ( 22 ) 

(Z,(z)| = (Zfc(0)|+z(ZU0)| + y (Z;'(0)| + --- (23) 

for sufficient small values of |z|. The first order terms of 
the power series are given by |24| 


Km 

mm) 

mm\ 


(^fc(0)|B|rfc(0)) 

(/fe( 0 )|rfc( 0 )) 

ki( 0 ))(^»( 0 )|B|rfc( 0 )) 

{Kio)-K{o))(iimmm) 

^ (4(0)|B|r,(0))(Z,(0)| 

^emk} imrn - mo)){hm\nm) 


(24) 

(25) 

(26) 


2. Calculation of the mean decision time 


The expansion of the exponential in Eq. (191 for z small 
yields 


P{z) = P + zP' + Om) . 
with P the transition probability matrix and 

(*|Pi|j) 


P' = H l*)(*|P|7)(j| In 


{^\Po\J) 


(27) 


(28) 


Identifying in Eqs. (24l and (26l C(z) as P(z:), A as P 

(*iPibr 


and B as P', we obtain 


^i(o) = (*|P|7)(jbss) In 

We further have (see Appendix) 

i,jGQ 


IPoIj) 


(»|Pi|j) 

(*|Po|j) 


(29) 


J2m\Pn) - UIPbs)) 


n—0 


(30) 


with \pn) = P”|p) with \p) a distribution on phase space 
or in other words a normalized right eigenvector with 
(l|p) = 1. For Ip) = Ipss), we thus have (Z((0)|pss) = 
0 , and therefore the second term in the numerator in 


Eq. (18) vanishes. 


Using the results shown above we can rewrite Phatar- 
fod’s mean decision time formula (ITsl as 


^ [n-dec 


7-^0 - 




' (1 ~ l)Ll + Edec[-^ex] 
m\j){j\Pss) ln({^^) 


(31) 


where 7 is the probability to hit Lq. For deriving Eq. (311 


we have used that the log-likelihood ratio at the decision 
reaches exactly one of the two values Lq or Li, 
and therefore = jLq -f (1 — 7 )^ 1 . We have also 

defined the expected value of the excess log-likelihood 
ratio starting from the state at the decision time 

Edec[£ex]=^E(^|P|j')l^(|f^ 


E( 0 '|P”l^^dec)-(jb.s)) 


n =0 


(32) 


with pdec = E[sendee]- It I® now possible to take the con¬ 


tinuous time limit of the log-likelihood ratio in Eq. (151 


which yields for Eq. (31) 


E[rdec] = 


JLq -|- (1 — 7 )Li -|- Edec[£ex] 

E[d£/dt] 


(33) 


where dC{t)/dt = limAi-).o(l/At)[£(t -f At) — C{t)]. For 
symmetric error probabilities, ao = ai = a, Lq = —Li = 
—L and therefore 7 L 0 + (1 ~ l)Li = L(\ — 2a). In this 


case, Eq. (33) corresponds to Eq. (4) in the Main Text. 


D. Derivation of Eq. (30) 


Let us rewrite Eq. (26) identifying C(z) as P(z), A as 
P and B as P', 


/U /n\\ \ (l|P1u)(^ib) 


= (j|P|TO)ln 

m,j^Q 


(j|Pi|nt) 

{j\Po\m) 


i\Qp) , 


(34) 


where we have defined the matrix Q as 

<3= U - (35) 

and for simplicity we have used the notation Ai = Ai(0), 
Vi) = |?'i(0)) and {li\ = (£(0)|. Since P is a transition 






























probability matrix, its eigenvalues i ^ 1 satisfy |Ai| < 1, 
and therefore we can write 


(i-A.)-^ = ^Ar 


m—Q 


As a result, 


(36) 


which thus is: 


u)A™((i 

m=0iGn\{l} 

(37) 

00 

m—0 

(38) 

00 

m—O 

(39) 


Substituting Eq. (391 in (341 yields Eq. (301. 


S2. FLUCTUATION THEOREMS FOR 
FIRST-PASSAGE TIME DISTRIBUTIONS 


Here we derive the fluctuation theorem for the distri¬ 
bution of the first passage times of entropy production 
given by Eq. (10) in the Main Text. 

For a translationally invariant and steady state Marko¬ 
vian process X, the transition probability P{x\x' 
denotes the probability density of X{t) given that 
X{t') = x'. The transition probability satisfies 

P{x\x' -,1,1') = P{x — x'|0;f — t',0) = P{x — x';t — t'). 
Let us assume that the transition probability fulfils a 
standard fluctuation relation 


P{-x;t) 


(40) 


where ?7 is a positive real number. We now define three 
different first-passage time distributions: 


1. The probability 4>(t; x)dt that X passes for the first 
time through a single boundary located at a relative 
coordinate x in the time interval [t, t -|- dt]. 

2. The probability <i)(t; x)dt that X passes for the first 
time X in the time interval [t,t + dt] without hav¬ 
ing reached —x before. Analogously, 4>(t; —a:)df is 
the probability that X reaches —x for the first time 
without having reached a barrier located at x be¬ 
fore. 


3. The probability n(t; x)dt that X passes for the first 
time through x in the time interval [t, t -|- dt] after 
having crossed —x at least once before. 


The event, that X reaches unconditionally the single bar¬ 
rier at X equals the probability of the two disjoint events 


that X has crossed the boundary at —x or has never 
crossed the boundary at —x: 


4>(t; a:) = 4>(t; x)-|-n(t; x) . 


(41) 


We first derive a fluctuation relation for the one¬ 
boundary first-passage time distribution 'k(t;x). 4'(t;x) 
satisfies the following fluctuation relation 


'i’{t;x) 

4'(t;-x) 


(42) 


As we will show in the following, Eq. (42) follows from 


the relationship between the transition probability and 
the one boundary first-passage time distribution |25| . 


P{x;t) = f dt''d){t';x)P{0;t— t') , 

Jo 


(43) 


and the Laplace transform of the transition probability 
equals to 

P(x; s) = 4'(s; x)P(0; s) . (44) 

Here we have introduced the Laplace transforms 


P(x; s)= dt e 

Jo 

poo 

^(s; x) = / dte 

Jo 


-^‘P(x;t) 
-"‘«'(t;x) 


(45) 

(46) 


Since the fluctuation relation is independent of time, the 
Laplace transform of the transition probability satisfies 

= e’'" (47) 

P(-x;s) 

which together with the factorization property given by 


Eq. (|44| yields a fluctuation theorem for the Laplace 

(48) 


transform of the first-passage time distribution 4': 
^(s;x) 


4/(5; —x) 


which implies the fluctuation relation (421. 


We now derive the fluctuation relation for the first- 
passage time distribution 4)(t; x) of the two boundary 
scenario, given by Eq. (9) in the Main Text. 

The probability that the process X reaches the bound¬ 
ary at X having crossed — x at least once before equals 
the probability that X reaches —x at t' and then reaches 
unconditionally the barrier at x in the time interval t — t' 
summed over all mutually exclusive events over t — t': 


n{t;x) = f dt'4)(t'; —x)4'(t — t'; 2x) . 

Jo 


(49) 


Using Eq. (41) and Eq. (49), 4'(f;x) equals to 
4'(f; x) = 4>(t; x) -|- fl{t; x) 

= 4>(t;x)-|- f df'4>(t'; —x)4>(f — t'; 2x) . 

Jo 















9 


Taking Laplace transforms in both sides we obtain 


Laplace transform of the first-passage distribution $: 


T(s; a;) = $(s; x)-I-$(s; —2a;) . 

We thus have that 


^(s;—rc) 


^(s; 2a;) 


^'(s; a;) 

^is;-x) 1 + J(£i£L^(s;_2x) 


(50) 


(51) 


Solving for the ratio $(s; a;)/$(s; —a;) and using the fluc¬ 


tuation theorem in 'I' given by Eq. (481 we obtain 

$(s;a;) — 4'(s; 2a;) 

$(s; —a;) 1 — e''“^(s; —2a;) 

^ / l-e-’^^^(s;2a;) \ 

I 1 - e^^^{s;-2x) j 


(52) 


Equation (521 expresses a fluctuation theorem for the 


l>(s; x) 
l>(s; —x) 

which yields equivalently 

$(t;x) 


(53) 


4>(t;-a;) 


(54) 


Equation (541 corresponds to Eq. (9) in the Main Text 
for X = AS'tot and rj = 1/k. 


Equation (541 holds for processes where the transition 


probability is translationally invariant and satisfies the 


fluctuation theorem given by (401. Fluctuation theorem 


(13) with X = AS'tot for the fluctuations of entropy pro¬ 
duction is known to hold for a wide class of processes 
including stochastic diffusive processes [MU], chaotic 
systems m and sheared fluids |10| . 
















